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Abstract 

The method of extrapolating asymptotic series, based on the Self-Similar 
Approximation Theory, is developed. Several important questions are answered, 
which makes the foundation of the method unambiguous and its application 
straightforward. It is shown how the extrapolation of asymptotic series can be 
reformulated as forecasting for time series. The probability measure is introduced 
characterizing the ensemble of forecasted scenarios. The way of choosing the 
complete family of data bases is advanced. 
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1 Introduction 



The problem of reconstructing functions from their asymptotic series is constantly met 
in all branches of physics and applied mathematics [1]. Not less ubiquitous is the 
problem of forecasting for time series [2] . Here we show that these two problems have 
much in common and can be solved in the same way. The solution stems from the 
Self-Similar Approximation Theory [3-13]. The basics of the particular techniques to 
be considered here have been suggested earlier and the feasibility of applying them 
to analyzing time series has been discussed [14,15]. However several important points 
remained unclear. The aim of the present paper is to develop further the theory, so 
that to remove ambiguities left, to provide a firm mathematical foundation, and to 
advance some fresh ideas permitting one to extend the applicability of the method. 
More concretely, 1 aim at providing answers to the following questions: 

(1) What presentation of asymptotic series is the most correct and general for their 
extrapolation involving self-similar exponential approximants? 

(2) What is the result of the fractal transformation, that is, of the multiplicative 
power-law transformation? 

(3) What should be the power of the fractal transformation? 

(4) How to define control functions characterizing the effective time of motion with 
respect to the approximation number? 

(5) What is the correct presentation of time series, as a forward or backward recur- 
sion, in order that forecasting for time series could be formulated as extrapolation of 
asymptotic series? 

(6) How to select among several scenarios forecasted by using different data bases? 

(7) How to construct a complete family of data bases, that is, how many terms 
from the past-history data base are to be taken and how to vary the related past-time 
scales? 

2 Extrapolation and Forecasting 

Let us start with the problem of extrapolating asymptotic scries. Assume, for simplic- 
ity, that the sought function f{x) is real and depends on a real variable x. Suppose 
that all we know is the asymptotic behaviour of this function, f{x) ~ fki^), in the 
vicinity of x = 0. The index /c = 0, 1, 2, . . . here enumerates approximations. The most 
common form of such asymptotic approximants is the power series 

k 

h{x) = E «n X"" , (1) 

n=0 

in which Oq 7^ and the powers q;„ are arranged in the ascending order 

an < an+i (n = 0, 1, . . . , A;) . 
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One usually has = n, but, for generality, we may imply that the powers are 
arbitrary real numbers, that even can be negative, provided the ascending order is 
retained. 

The problem of extrapolation of asymptotic series sounds as follows: How to define 
the function f{x) at finite x from the knowledge of its asymptotic expansions (1) valid 
only in the asymptotic vicinity of a; = 0? 

In order that mathematical manipulations be independent from the choice of phys- 
ical units which the sought function is measured in, it is more appropriate to deal with 
a dimensionless function. This can be easily achieved by factorizing out the zero term 
fo{x) — aox°'°, with ao 7^ 0, and introducing the dimensionless function 

This point is rather trivial and well understood in physics, where one always prefers to 
deal with scale-invariant quantities. 

Clearly, the variable x is also to be dimensionless. When the variable pertains to a 
finite interval, it always can be scaled so that to be in the unitary interval 

< a; < 1 . (3) 

This requirement will be assumed in what follows. And it will be shown that condition 
(3) is not merely nontrivial but principal. If the initial physical variable is given on an 
infinite interval, one can change the variables so that the new variable be defined on a 
finite interval, and then, by the appropriate scaling, one can reduce the latter interval 
to the unitary one. 

Thus, we consider the asymptotic series 

k 

Vk{x) = Y. bn X'^'^ , (4) 
n=0 



in which 



bn = — , l3n = 0!n-ao>Po = 

ao 



The sequence {</?fe(x)}^o has the zero radius of convergence, being divergent for any 
finite X. It converges solely at one point a; = 0. How it would be possible to extrapolate 
the asymptotic expansion (4), having sense only for a; — > 0, to the finite interval (3)? 

Such an extrapolation can be done by employing the Self-Similar Approximation 
Theory [3-13]. The main idea of the latter is to present a relation between subsequent 
approximants as the motion with respect to an effective time whose role is played by 
the approximation number k. However, it is meaningless to look for a relation between 
the terms of a divergent sequence. Hence, the first thing one has to do is to transform 
the divergent sequence {ipk{x)} to a convergent form, which can be accomplished by 
a transformation involving control functions [3]. The name of the latter comes from 
their role in controlling convergence. In the case of the power series (4), a convenient 
transformation is the multiplicative power-law transformation [10-12], which is simply 
the multiplication of (pk{x) by a power-law factor a;*, where s is a control function. 
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Since the power laws are generic for fractals [16], this kind of multiplication can be 
termed the fractal transformation. In this way, we introduce the fractal transform 



^kix,s) = x^ipkix) , (5) 

whose inverse is 

(Pk{x) = x'''^k{x,s) . (6) 
The resulting fractal transform is 

M^,s) = j2 bnx'^^-. (7) 

n=0 

This can be considered as asymptotic series, for all \x\ < 1, provided that 

s ^ oo . (8) 

The value of expansion (7) at x = 1 can always be defined as the limit from the 
left. Thus the sequence {^k{x, s)} can be treated as convergent on the whole unitary 
interval (3) under condition (8). 

For a convergent sequence, it becomes meaningful to try to find a relation between 
subsequent terms. Such a relation, according to the Self-Similar Approximation Theory 
[3-9], can be formulated as the motion with respect to the approximation number k. In 
mathematical parlance, the motion can be expressed as the semigroup property for a 
family of endomorphisms. To this end, we define the function x{(f, s) by the equation 



Then, we introduce 



^o{x,s) = ip, x^x{ip,s). (9) 



yk{f,s) = ^k{x{(p,s),s), (10) 



which is an endomorphism of R. The semigroup property for an endomorphism yk 
reads as yk+p = yk ■ Up- This is equivalent to the property of group self- similarity 

Vk+pi^P, s) = ykiVpi^P, s),s) . (11) 

The family {yk\ k = 0,1,2,...} of the endomorphisms yk, with the semigroup 
property (11), forms a cascade, that is, a dynamical system in discrete time k. Then 
Eq. (11) is the evolution equation of the cascade, with the initial condition 

yo{^) = ^ , (12) 

which results from Eqs. (9) and (10). The functional equation (11) does not uniquely 
define a solution yk{^)- Among the wide class of solutions of Eq. (11) there are, in 
particular, the power- law solutions yk{'P, s) = (^"fc^*) provided that their powers satisfy 
the condition ak+p{s) — ak{s) + ap{s). But the power-law expressions form only a 
narrow class of possible solutions of Eq. (11). Hence the group self-similarity (11) is 
essentially more general property than the simple geometric self-similarity resulting in 
power-law solutions [16]. 
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To find a solution of tlie functional equation (11), corresponding to a fixed point, we 
can proceed as follows [4-9]. Let us embed the cascade {yk\k = 0, 1, 2, . . .} into a flow 
{Vrl T ^ 0}, which implies that the latter satisfies the same self-similarity relation (11) 
with the initial condition (12). This group relation for the flow yields the Lie equation, 
which is the differential equation with respect to the continuous effective time r. The 
differential equation may be integrated from the effective time r = n to the time n + t„ 
that is necessary for reaching a quasifixed point y* which is an approximate fixed point. 
The resulting evolution integral is 



dip 



where 



(13) 



Vniv, S) = Vniv, S) - Vn-liv, s) = &„ cp^+f^r./s (^4) 

is the cascade velocity. Prom the evolution integral (13), we get the quasifixed point 
y* — y^{(p,s), which, according to Eqs. (9) and (10), defines 

mx,s)^yl{x\s). (15) 

Then we have to realize the inverse fractal transformation (6), with the limit (8), which 
gives 

iplix) = limx-' ^l{x,s) . (16) 

This procedure is to be accomplished so many times as necessary for reorganizing 
all series entering (pl{x). Such a repeated procedure has been called [11] self-similar 
bootstrap. Here we suggest the fastest way of accomplishing this bootstrap, when 
the /c-order series (fik{x) requires just k iterations. To this end, we may present the 
asymptotic expansion (4) as 

(Pk{x) = l + xi, (17) 
with the sequence of the iterative terms 

b b 

Xi = X^^-^° (1 +X2) , X2^ ^ X^^-^^ (1 + 2:3) , 

x„ = T^x^"-'^"-(l + x„+0, x, = ^x^^-P''-^ . (18) 

Then we may consider each expression l-\-Xn as asymptotic series with respect to small 
— > 0. The self-similar renormalization described above results in transforming l-\-Xn 
to exp(a;„T„). Starting with the form (17), we accomplish k steps of this self-similar 
interactive procedure. Then, with the notation 



where n = 1, 2, . . . , A;, we come to the exponential self-similar approximant 

(flix) = exp {cix"^ exp {c2x'''^ . . . exp (cfca;"*)) . . .) . (20) 
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What is yet left undefined in the approximant (20) is the effective times, r„, which 
play the role of control functions. The latter may be determined from fixed-point 
conditions [3,12]. Here we advance a more general way of defining control functions r„. 
This way is common for optimal control theory, where control functions are defined from 
the minimization of a cost functional. To construct the latter, one has to formulate 
the requirements imposed on the sought control functions. In our case, wc would 
like to reach a quasifixed point as fast as possible. The minimal effective time for 
this corresponds to one step of the iterative procedure. If we reach an answer in n 
steps, then the minimal effective time should be close to 1/n. Another requirement 
characteristic of the motion near a quasifixed point is the smallness of the distance VnTn 
passed at the n-th step. This suggests to construct the time- distance cost functional 



2 ^ 

n 



Tn + {VnTn) 

nj 



Minimizing F with respect to r„ yields 

1 



(21) 



(22) 



Here the ^-representation of the cascade velocity Vn{x) = x~^Vn{x^ i s) results in 

Vn{,x) = (pn{,x) - (pn-i{.x) = . (23) 

Summarizing, for expressions (19) we have 

^n{x) — 'Tn{x) i Oin—1 i 

rn(x) = \ , vr^ix) = ^ . (24) 

n[l + v^{x)\ ao 

The functions c„(x), being proportional to the control functions Tn{x), can be called 
controllers. 

Finally, for the asymptotic series (1), we obtain the self-similar extrapolation 

fl{x) = fo{x) exp {cix"^ exp {c2x'''' . . . exp {ckx"'')) . . .) , (25) 

where c„ = c„(a;) and z/„ are given in Eq. (24), and which extrapolates the series (1), 
valid only for x — 0, to the whole interval (3). 

Now we shall show how the extrapolation of asymptotic series can be reformulated 
as forecasting for time series. This reformulation, as will be evident from what follows, 
requires that the considered time series be presented as a backward recursion, with the 
related moments of time being arranged in the backward order 

tn+i<tn (n = 0,l,2,...,A;) . (26) 

The time series for a quantity of interest is an ordered set {/«} of the values /„ of 
this quantity, measured at the corresponding times tn- The past time horizon is the 
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time interval [t^, 0] when the series is observed. The present time is chosen to be at 
^0 — 0. Let the considered time series be described by the past-history data base 

Bk = {fk, fk-i, ...,fo\tk< tk-i < . . . < 0} . (27) 

The problem to be attacked is how, on the grounds of the given data, to predict the 
behaviour of the time series in future times t > 07 

In order to employ the technique developed for asymptotic series, we need, first, 
to construct a sequence of functions approximating the behaviour of the time series at 
asymptotically small time t ^ 0. Similarly to the case of asymptotic series, we have to 
work with the quantities reduced to the scale-invariant dimensionless form. And the 
time variable is to be normalized with respect to the length of the prediction horizon 
so that this be a unitary interval 

0<t<l. (28) 

For the given data base (27), one may construct an interpolating function fk{t) 
such that 

fk{tn) = fn (n = 0,l,...,A;) . (29) 

This interpolation can be uniquely realized e.g. by means of the Lagrange interpolation 
formula [17] as the sum 

k 

fkit) = E fn m (k > 1) (30) 

n=0 

over the Lagrange polynomials 

n 1^ {n<k). 

Formula (30) can be rewritten as the power series 

/fc(t) = i:a„r. (31) 

n=0 

The coefficients depend on the given data base (27), but an additional index k is 
not explicitly shown here for brevity. Expressions (30) or (31), by construction, satisfy 
Eq. (29), including the moment of time tQ — 0, where 

A(0) = /o . (32) 

Therefore, the interpolative formula (31) can be considered as an asymptotic series 
with respect to i — > -|-0. 

In the same way as asymptotic series, expression (31) has no sense for finite t > 0, 
that is, cannot be directly employed for predicting future. But, being just a particular 
kind of asymptotic series, Eq. (31) can be extrapolated to the whole interval (28) 
following the general method of self-similar extrapolation. Then, analogously to Eq. 
(25), we obtain the self- similar forecast 

fkit) = fo exp {cit exp {c2t . . . exp (c^t)) . . .) , (33) 



7 



where c„ = Cn{t) are the controllers 

Cn{t) = Tn{t) {n=l,2,...,k) , 

^nit) = ^7-^ V^{t) = ^ r . (34) 

In this way, for each given past-history data base (27), we may construct the forecast 
(33) vahd for the future time horizon (28). Since the coefficients a„ = ank depend on the 
choice of the data base, according to the conditions (29), the controllers c„(t) = Cnkit) 
also depend on the data base. This means that, for another data base, we shall obtain 
another value of the forecast. There are two things that may be varied for data bases: 
the number of points k and the set of the time moments. Denoting a particular 
choice of the latter set with the index j, we have {t!^^}- Thus, each data base has to 
be labelled by two indices 

^ {fi'\ fl'li. • • • , /ol < < . . . < 0} . (35) 

Varying these indices produces a family {©^(j)} of the data bases and, respectively, 
results in an ensemble {/^ (j, t)} of the self-similar forecasts. How could we classify this 
ensemble of possible scenarios? 

The problem of classifying scenarios in forecasting is analogous to the problem of 
pattern selection in the case of nonunique solutions of nonlinear partial differential 
equations [18]. A probabilistic approach to the problem of pattern selection has been 
recently advanced [19], which can also be used for defining a probability measure for 
the ensemble of forecasted scenarios. Following the approach [19], we treat the map 
{/fe(j) t)| = 0, 1, 2, . . .} as the image of a dynamical cascade, where k plays the role 
of discrete time, and for the probabihty of a forecast (j, t) we have 

Pk{3, t) = ^ exp {-A,Sfe(j, t)} , (36) 

where Zk{t) is a normalization factor and ASk{j,t) is an entropy variation. Let us 
introduce the mapping multiplier 

and the average multiplier rf%k{t), such that 

= E 1 — r-^ ■ (38) 

\rnk{t)\ Y \mk{j,t)\ ^ ' 

Then the scenario probability (36) can be reduced [19] to the form 



Pk{j,t) 



mkjt) 
mk{j,t) 



(39) 
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The most probable scenario is, by definition, that one corresponding to the maximal 
probability (39). 

It may happen that the data-base variation is limited by the information available. 
Then there is no choice as to deal with the given family {©^(j)}. But if this variation 
can be done in a very wide range, the question remains as when should one stop varying 
data bases? A brief answer to this question is: Vary data bases until reaching numerical 
convergence. More precisely, the variation procedure can be realized as follows. Let us 
fix a time scale, labelled by j, and increase the data-base order k, that is the number 
of points from the past history. The increase of k is to be stopped at the saturation 
number Nj, after which the forecasts f^{j,t), for k > Nj, practically (within a given 
accuracy) do not change. Then the forecasts (33) as well as the scenario probability 
(39) become dependent only on the time scale, labelled by j, 

ru,t) = ,%fkij,t) > pM = • (40) 

Varying the data-base time scale, it looks convenient to deal with equidistant steps 
Aj = t^'' — where n = 0,1, . . . , Nj, although the equal distance between the time 
moments is, in general, not obligatory. It is natural to start with the step Aj = 1, 
coinciding with the prediction horizon. Then we may increase as well as decrease 
the time grid, which can be done in different ways, say by following the law /S.2j — 

2~^, A2j+i = 2-^, with j — 0, 1,2, Again, the further variation of the time mesh 

is to be stopped at j = j^ax as soon as we reach numerical convergence. In this way, 
the complete data fam,ily is a set {DAr^ (j)| j = 0, 1, 2, . . . ,jmax} corresponding to the 
ensemble {f*{j,t)\j = 0, 1, . . . , jmax} that can be called the scenario spectrum. As 
far as this ensemble of forecasts is weighted with the probability measure p{j,t), it is 
straightforward to define the expected forecast, the dispersion, and other quantities 
typical of probabilistic ensembles. 

3 Conclusion 

The possibility of forecasting the main trends of time series is of great importance and 
can have various applications. One of such exciting application would be with respect 
to economics and financial time series. The behaviour of markets is nowadays studied 
by different methods imported from physics [20]. However the problem of quantitative 
prediction for markets remains yet a puzzle. I would think that the novel approach, 
developed in this paper, is a practical step for solving the problem of forecasting for 
time series, including financial time series. The aim of this communication has been 
to present a mathematical foundation for the approach, whose applications will be 
considered in separate publications. The content of this paper answers the questions 
posed in the Introduction, and these answers are: 

(1) The most general and correct presentation of asymptotic series, appropriate for 
the self-similar extrapolation is in a dimensionless scale-invariant form, but with the 
domain of a variable reduced to the unitary interval. 
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(2) The fractal transformation extends the region of vahdity of the initial asymptotic 
series. 

(3) The power of the fractal transformation must asymptotically tend to infinity in 
order to yield series with the unitary radius of convergence. 

(4) The effective time of motion with respect to the approximation number is to be 
defined from the minimum of a cost functional. 

(5) In order that extrapolation of asymptotic series could be correctly reformulated 
as forecasting for time series, the latter are to be presented as a backward recursion. 

(6) All forecasted scenarios, derived by employing different data bases, compose a 
statistical ensemble equipped with a probability measure characterizing the probabili- 
ties of particular patterns. 

(7) The complete data family is a set of data bases, with time scales varying in the 
range limited by the saturation of calculations, and with the numbers of past-history 
terms sufficient for providing numerical convergence for each given time grid. 
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